Unsupervised Acoustic Model Training for Simultaneous Lecture Translation in Incremental and Batch Mode

نویسندگان

  • Michael Heck
  • Satoshi Nakamura
چکیده

In this work the theoretical concepts of unsupervised acoustic model training and the application and evaluation of unsupervised training schemes are described. Experiments aiming at speaker adaptation via unsupervised training are conducted on the KIT lecture translator system. Evaluation takes place with respect to training e ciency and overall system performance in dependency of the available training data. Domain adaptation experiments are conducted on a system trained for European parliament plenary session speeches with help of unsupervised iterative batch training. Major focus is on transcription pre-processing methods and confidence measure based weighting and thresholding on word level for data selection. The objective is to lay the foundation for an unsupervised adaptation framework based on acoustic model training for use in KIT’s simultaneous speech-to-speech lecture translation system. Experimental results show, that it is of advantage to let the Viterbi algorithm during training decide which pronunciations to use and where to insert which noise words, instead of fixating these informations in the transcriptions. With weighting and thresholding it is possible to improve unsupervised training in all test cases. Tests of iterative incremental approaches show that potential performance gains strongly correlate to the performance of the baseline systems. Considerable performance gains are observable after only one iteration of unsupervised batch training with applied transcription pre-processing, weighting and thresholding.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Incremental Unsupervised Training for University Lecture Recognition

In this paper we describe our work on unsupervised adaptation of the acoustic model of our simultaneous lecture translation system. We trained a speaker independent acoustic model, with which we produce automatic transcriptions of new lectures in order to improve the system for a specific lecturer. We compare our results against a model that was trained in a supervised way on an exact manual tr...

متن کامل

Robust Hmm Training and Adaptation in Hands-free Speech Recognition

In this paper a challenging scenario is addressed in which a hands-free speech recognizer operates in a noisy office environment with either batch or incremental model adaptation. The application of a microphone array processing compensates only for part of the mismatch between training and testing acoustic conditions. In a previous work it was shown that the acoustic mismatch can be further re...

متن کامل

Incremental adaptation with VTS and joint adaptively trained systems

Recently adaptive training schemes using model based compensation approaches such as VTS and JUD have been proposed. Adaptive training allows the use of multi-environment training data whilst training a neutral, “clean”, acoustic model to be trained. This paper describes and assesses the advantages of using incremental, rather than batch, mode adaptation with these adaptively trained systems. I...

متن کامل

Incremental Training and Intentional Over-fitting of Word Alignment

We investigate two problems in word alignment for machine translation. First, we compare methods for incremental word alignment to save time for large-scale machine translation systems. Various methods of using existing word alignment models trained on a larger, general corpus for incrementally aligning smaller new corpora are compared. In addition, by training separate translation tables, we e...

متن کامل

Incremental Visual Behaviour Modelling

We develop a novel visual behaviour modelling approach that performs incremental and adaptive behaviour model learning for online abnormality detection. Three key features make our approach advantageous over previous ones: (1) unsupervised learning, (2) online and incremental model construction, and (3) model adaptation to changes in visual context. In particular, we formulate an incremental EM...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014